Attorney Docket: DMJ 2-002 



CLAIMS 

1 . A method for evaluating the text content of a document database with 
respect to a document population, comprising the steps of: 

(a) providing a computer system having a user interface with a 

display; 

S (b) gathering documents from said database into said system; 

(c) normalizing said gathered documents; 

(d) fingerprinting said gathered documents; 

(e) determining a text criteria with respect to said document 

population; 

10 (f) forming a net comprising at least two nodes associated by at 

least one interaction and displayable at said display as two or more spaced apart 
nodes connected by an interaction; 

(g) loading said text criteria into at least one of said nodes; 

(h) for each document of said database, calculating its geometric 
15 relative distance from a said node to derive one or more node attractors; 

(i) displaying said net at said display in combination with one or 
more document symbols each representing a said document located in 
correspondence with said calculated relative distance; 

(j) visually examining the display of said net and document 

20 symbols; and 

(k) determining from said document symbol locations at said 
display those documents, if any, which are more likely to correspond with said text 
criteria. 

25 2. The method of claim 1 in which: 

said step (f) forming a net provides for the display of said net as 
having said nodes defined as circles and said interaction defined as a line extending 
between said circles. 

30 3. The method of claim 1 in which: 

said step (g) loads said text criteria into a positive designated one of 

said nodes. 
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4. The method of claim 1 in which: 

said step (f) forms said net as comprising a said positive designated 
node and a null designated node connected by a said interaction. 

5 5. The method of claim 1 in which: 

said step (e) determines said 
text criteria as criteria document textual material; and 
said step (g) comprises the steps: 

(g1 ) normalizing said criteria document textual material; and 
10 (g2) fingerprinting the normalized criteria document textual material. 

6. The method of claim 1 in which: 

said step (e) determines a positive text criteria and a negative text 

criteria with respect to said document population; 
15 said step (f) forms a net comprising one or more positive designated 

nodes, one or more negative designated nodes and one or more interactions; 

said step (g) loads said positive text criteria into said one or more 

positive designated nodes, and said negative text criteria into said one or more 

negative designated nodes; and 
20 said step (h) calculates, for each document of said database its 

geometric relative distance from both said positive designated node and said negative 

designated node. 

7. The method of claim 1 in which: 

25 said step (i) displays said one or more document symbols as squares. 

8. The method of claim 1 including the steps: 

(I) retrieving the identification of those documents resulting from 
the detemriination of step (k); / 
30 (m) viewing one or more of the documents identified in step (I) and 

determining the quality of the match thereof with said step (e) text criteria. 

9. The method of claim 8 further comprising the steps: 
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(n) identifying a new text criteria as a result of a said step (m) 
determination of an insufficient said quality of said match; 

(o) adding the identified new text criteria to the step (g) text criteria 
loaded into said positive designated one of said nodes; and 
5 (p) reiterating said steps (h) through (m). 

10. The method of claim 8 further comprising the steps: 

(q) subsequent to said step (m), identifying and viewing at said 
display a list of features common to those documents the identification of which was 
10 retrieved in step (I); 

(r) identifying a new text criteria in correspondence with said step 
(q) identification and viewing at said display of said features common to those 
documents the identification of which was retrieved in step (I); 

(s) adding the identified new text criteria to the step (q) text criteria 
15 loaded into said positive designated one of said nodes; and 

(t) reiterating said steps (h) through step (m). 

1 1 . The method of claim 1 in which: 

said step (k) further comprises the steps: 
20 (k1 ) determining additional text criteria where said document symbol 

locations are not likely to corespond with said text criteria determined at step (e); and 

(k2) adding said additional text criteria to said text criteria 
determined at said step (e). 

25 12. The method of claim 8 in which: 

said step (I) is carried out by drawing at said display of said net a 
boundary defining a region of said symbols. 

13. A method for evaluating the text content of a document database with 
30 respect to a population of documents, comprising the steps of: 

(a) providing a computer system having a user interface with a 

display; 
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(b) forming one or more nets each comprising at least two nodes 



associated by at least one interaction, one or more said nodes representing an 
evaluation criteria, said one or more nets being viewable at said display; 



5 calculating for each document a geometric relative distance with respect to a said 
node criteria and displaying corresponding document symbols at said display; 



14. The method of claim 13 in which: 

said step (d) provides a said first region extending over more than one 
said net; and includes the step: 



the union or intersection of documents on different nets. 

1 5. The method of claim 1 3 in which: 

said step (e) provides a said second region extending over more 
25 than one said net; and including the step: 

(el) mapping said second region to a second document set by 
selecting the union or intersection of documents on different nets. 



(c) 



treating said documents to have an attribute value and 




20 



(d1) mapping said first region to a first document set by selecting 



16. 



The method of claim 13 in which: 



30 



said step (f) selects said criteria for establishing an attribute value 
match by defining an attribute value tolerance. 



17. 



The method of claim 16 in which: 
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said step (g) determines the presence of a document attribute match 
pair by determining whether the attribute value of a document in said first region is 
equal to the attribute value of a document in said second region within said attribute 
value tolerance. 

5 

1 8. The method of claim 1 3 in which: 

said step (d) is carried out by providing a computer generated line or 
lines visible at said display. 

10 19. The method of claim 1 3 in which: 

said step (e) is carried out by providing a computer generated line or 
lines visible at said display. 



20. The method of claim 1 3 in which: 

15 said step (h) is carried out by providing visible line at said display 

connecting two said symbols and representing said congelation. 

21 . The method of claim 13 in which: 

said step (f) selects said document attribute or document identification; 

20 and 

said step (g) identifies the same document in each said first and 
second region as a said correlation. 



22. A method for searching the text content of a document database with 
25 respect to a population of documents, comprising the steps of: 

(a) providing a computer system having a user interface with a 

display; 

(b) 
(c) 

30 steps comprising; 

(c1) 
(c2) 
(c3) 
(c4) 



identifying the population of documents to be searched; 
normalizing the documents of the identified population with the 

selecting character sequences that will separate words, 
determining to either retain or eliminate punctuation characters, 
setting regular expressions that will characterize numbers, 
setting case behavior, 
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(c5) setting an offset and factor for numeric class. 

(c6) converting a document of said identified population to a 
character sequence, 

(c7) accessing the words, or punctuation characters, W of said 
5 character sequences, 

(c8) for each accessed W which is a number, converting such 
number into a sequence of word numbers, WN, and normalizing said word numbers 
for fingerprinting, 

(c9) marking the position and length of each W or normalized word 

10 number WN, 

(c1 0) for each W or normalized WN, completing said nomialization by 
reiterating steps (c8) and (c9); 

(d) fingerprinting said normalized documents; 

(e) forming one or more nets, each comprising at least two nodes. 
15 one or more said nodes representing an evaluation criteria, said one or more nets 

exhibiting two or more spaced apart nodes connected by one or more interactions; 

(f ) for each normalized document, calculating its geometric relative 
distance from a said node; 

(g) displaying said one or more nets at said display in combination 
20 with one or more document symbols representing a said document located in 

correspondence with said calculated relative distance; and 

determining from said document symbol locations at said display, if 
any, those documents which are more likely to correspond with said evaluation 
criteria. 

25 

23. The method of claim 22 in which said step (c8) further comprises the 

steps: 

(c81 ) convert any date characterized word number WN to a float or 

integer, 

30 (c8.2) applying an offset and factor to the word number WN to derive 

X. 

(c8.3) set the range, R. 

(c8.4) calculate the quantity T = (log^oXyR. 
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(c8.5) successively decrementing the value of range, R and 

calculating the quantity, T until R is equal to zero, 

(c8.6) designating S as the position of a significant numeral in X, 
(c8.7) assigning each successive quantity T to a corresponding 

successive position S to derive the first component of nonnalized word number WN; 

and 

(cB.B) subsequent to said step (c8.7), assigning each successive 
said numeral in X to a corresponding successive position S to derive a second 
component of said normalized word number, WN. 

24. The method of claim 23 in which: 

said step (c8.3) further comprises the step: (c8.3.1) setting the 
precision P of the normalized word number WN, 

said step (c8.8) is carried out until the number of said successive 
positions S deriving said second component equals the value of said precision, R. 
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